UNCERTAINTY PRINCIPLES FOR ORTHONORMAL SEQUENCES 



PHILIPPE JAMING AND ALEXANDER M. POWELL 



Abstract. The aim of this paper is to provide complementary quantitative extensions of 
two results of H.S. Shapiro on the time-frequency concentration of orthonormal sequences 
in L 2 (R). More precisely, Shapiro proved that if the elements of an orthonormal sequence 
and their Fourier transforms are all pointwise bounded by a fixed function in L 2 (R) then 
the sequence is finite. In a related result, Shapiro also proved that if the elements of an 
orthonormal sequence and their Fourier transforms have uniformly bounded means and 
dispersions then the sequence is finite. 

This paper gives quantitative bounds on the size of the finite orthonormal sequences 
in Shapiro's uncertainty principles. The bounds are obtained by using prolate spheroidal 
wave functions and combinatorial estimates on the number of elements in a spherical code. 
Extensions for Riesz bases and different measures of time-frequency concentration are also 
given. 



The uncertainty principle in harmonic analysis is a class of theorems which state that a 
nontrivial function and its Fourier transform can not both be too sharply localized. For 
background on different appropriate notions of localization and an overview on the recent 
renewed interest in mathematical formulations of the uncertainty principle, see the survey 
|FSj . This paper will adopt the broader view that the uncertainty principle can be seen not 
only as a statement about the time-frequency localization of a single function but also as a 
statement on the degradation of localization when one considers successive elements of an 
orthonormal basis. In particular, the results that we consider show that the elements of an 
orthonormal basis as well as their Fourier transforms can not be uniformly concentrated in 
the time-frequency plane. 

Hardy's Uncertainty Principle [H] may be viewed as an early theorem of this type. To set 
notation, define the Fourier transform of / 6 L 1 (]R) by 



and then extend to L 2 (M) in the usual way. 
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Theorem 1.1 (Hardy's Uncertainty Principle). Let a,b,C,N > be positive real numbers 
and let f G L 2 (R). Assume that for almost every i^el, 

(1.1) |/(x)| < C(l + \x\) N e-™W 2 and |/(0| < C(l + Kl^e"^ 2 . 

T/ie following hold: 

• If ab> 1 then f = 0. 

• Ifab=l then f(x) = P{x)e~ 7 " 1 \ x \ for some polynomial P of degree at most N. 

This theorem has been further generalized where the pointwise condition is replaced 
by integral conditions in BDJ j, and by distributional conditions in [D]. Also see |GZj and 
|HL| . One may interpret Hardy's theorem by saying that the set of functions which, along 
their Fourier transforms, is bounded by C(l + IxD^e -71 "^' is finite dimensional, in the sense 
that its span is a finite dimensional subspace of I? (R) . 

In the case ab < 1, the class of functions satisfying the condition has been fully 

described by B. Demange D . In particular, it is an infinite dimensional subset of L 2 (R). 
Nevertheless, it can not contain an infinite orthonormal sequence. Indeed, this was first 
proved by Shapiro in |S1| : 

Theorem 1.2 (Shapiro's Umbrella Theorem). Let tp,tp G L 2 (R). If {e k } C L 2 (R) is an 
orthonormal sequence of functions such that for all k and for almost all 

\ek(x)\ < \<p(x)\ and \e k (0\ < \ip(0l 
then the sequence {ek} is finite. 

Recent work of A. De Roton, B. Saffari, H.S. Shapiro, G. Tennenbaum, see |DSST| . shows 
that the assumption <p,ip G L 2 (IR) can not be substantially weakened. Shapiro's elegant proof 
of Theorem 11.21 uses a compactness argument of Kolmogorov, see |S2j , but does not give a 
bound on the number of elements in the finite sequence. 

A second problem of a similar nature studied by Shapiro in Sl j is that of bounding the 
means and variances of orthonormal sequences. For / G L 2 (M) with ||/|| 2 = 1, we define the 
following associated mean 

M = Mean(|/| 2 ) = J t\f{t)\ 2 dt, 

and the associated variance 

A 2 (/)=Var(|/| 2 ) = J \t-^f)\ 2 \f(t)\ 2 dt. 

It will be convenient to work also with the dispersion A(/) = y A 2 (/). In |Slj . Shapiro posed 
the question of determining for which sequences of real numbers {a n }^L , {b n }^ =0 , {c n }^ =0 , 
{d n }^ =0 C R there exists an orthonormal basis {e n }^L for L 2 (R) such that for all n > 

fj,(e n ) = a n , (iffin) = b n , A (en) = c n , A(e n ) = d n . 

Using again Kolmogorov's compactness argument, he proved the following, |Slj : 

Theorem 1.3 (Shapiro's Mean-Dispersion Principle). There does not exist an infinite or- 
thonormal sequence {e n }^L C L 2 (R) such that all four of yu(e n ), n(e n ), A(e n ),A(e n ) are 
uniformly bounded. 
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An extension of this theorem in [F] shows that if {e n }^L is an orthonormal basis for L 2 (IR) 
then two dispersions and one mean A(e n ), A(e n ), /x(e n ) can not all be uniformly bounded. 
Shapiro recently pointed out a nice alternate proof of this result using the Kolmogorov com- 
pactness theorem from |Slj . The case for two means and one dispersion is different. In fact, 
it is possible to construct an orthonormal basis {e n }^L for L 2 (M.) such that the two means 
and one dispersion fi(e n ), fi(e n ),A(e n ) are uniformly bounded, see [F] . 

Although our focus will be on Shapiro's theorems, let us also briefly refer the reader to 
some other work in the literature concerning uncertainty principles for bases. The classical 
Balian-Low theorem states that if a set of lattice coherent states forms an orthonormal basis 
for L 2 (M) then the window function satisfies a strong version of the uncertainty principle, 
e.g., see |CP1 IGHHKj . For an analogue concerning dyadic orthonormal wavelets, see |Baj . 

Overview and main results. The goal of this paper is to provide quantitative versions of 
Shapiro's Mean-Dispersion Principle and Umbrella Theorem, i.e., Theorems 11.21 and 1 1 . 31 

Section [2 addresses the Mean-Dispersion Theorem. The main results of this section are 
contained in Section 12.31 where we prove a sharp quantitative version of Shapiro's Mean- 
Dispersion Principle. This result is sharp, but the method of proof is not easily applicable 
to more general versions of the problem. Sections 12.11 and 12.21 respectively contain necessary 
background on Hermite functions and the Rayleigh-Ritz technique which is needed in the 
proofs. Section [2.41 proves a version of the mean-dispersion theorem for Riesz bases. 

Section |31 addresses the Umbrella Theorem and variants of the Mean-Dispersion Theorem. 
The main results of this section are contained in Section [3.41 where we prove a quantitative 
version of the Mean-Dispersion Principle for a generalized notion of dispersion, and in Section 
13.51 where we prove a quantitative version of Shapiro's Umbrella Theorem. Explicit bounds 
on the size of possible orthonormal sequences are given in particular cases. Since the methods 
of Section El are no longer easily applicable here, we adopt an approach based on geometric 
combinatorics. Our results use estimates on the size of spherical codes, and the theory of 
prolate spheroidal wavefunctions. Section \'A . II contains background results on spherical codes, 
including the Delsarte, Goethals, Seidel bound. Section l3~2l proves some necessary results on 
projections of one set of orthonormal functions onto another set of orthonormal functions. 
Section T3.3I gives an overview of the prolate spheroidal wavefunctions and makes a connection 
between projections of orthonormal functions and spherical codes. Section fa. 61 concludes with 
extensions to Riesz bases. 



In this section, we use the classical Rayleigh-Ritz technique to give a quantitative version 
of Shapiro's Mean-Dispersion Theorem. We also prove that, in this sense, the Hermite basis 
is the best concentrated orthonormal basis of L 2 (M). 

2.1. The Hermite basis. Results of this section can be found in |FSj . The Hermite func- 
tions are defined by 



It is well known that the Hermite functions are eigenfunctions of the Fourier transform, satisfy 



2. Growth of means and dispersions 




hk = i k hk, and form an orthonormal basis for I? 



(M) . Let us define the Hermite operator H 
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for functions / in the Schwartz class by 
It is easy to show that 

(2k + l\ 

(2.2) Hh k = i^—)h k , 

so that H may also be seen as the densely defined, positive, self-adjoint, unbounded operator 
on L 2 (R) defined by 

Hf = Y / ^ 1 (f,h k )h k . 



2vr 

k=0 



From this, it immediately follows that, for each / in the domain of H 

(2.3) (Hfj) = f;^±l|(/,^)p = f ltl 2 im 2 dt+ f^f^fdt 

k=0 J J 

= M/) 2 ll/ll2 + A 2 (/) + M/) 2 ll/i + A 2 (/). 



2.2. The Rayleigh-Ritz Technique. The Rayleigh-Ritz technique is a useful tool for esti- 
mating eigenvalues of operators, see |RS1 Theorem XIII. 3, page 82]. 



Theorem 2.1 (The Rayleigh-Ritz Technique). Let H be a positive self-adjoint operator and 
define 

X k (H) = sup inf (Hip,ili). 

where D(H) is the domain of H . Let V be a n + 1 dimensional subspace, V C D(H), and let 
P be the orthogonal projection onto V . Let Hy = PHP and let Hy denote the restriction of 
Hy to V . Let fiQ < m < • • • < fi n be the eigenvalues of Hy Then 

Afc(i?) <Hk, k = 0, • •• ,n. 

The following corollary is a standard and useful application of the Rayleigh-Ritz technique. 
For example, |LLI Chapter 12] contains a version in the setting of Schrodinger operators. 

Corollary 2.2. Let H be a positive self-adjoint operator, and let ipo, ■ ■ ■ ,{p n be an orthonor- 
mal set of functions. Then 

n n 

(2.4) ^A fe (iI)<^(F^,^). 

fc=0 fe=0 

Proof. If some <p k ^ D(H) then positivity of H implies that (|2.4|) trivially holds since the right 
hand side of the equation would be infinite. We may thus assume that (fo, ■ ■ ■ ,(p n £ D(H). 

Define the n + 1 dimensional subspace V = span {(f k } k=0 and note that the operator Hy 
is given by the matrix M = [(Hcpj, (p k )]o<j. k < n - Let no, ■ ■ ■ , fi n be the eigenvalues of Hy, i.e., 
of the matrix M. By Theorem 12.11 

n n n 

^2HH) <J2^ = Tr ace(M) = J2(Htp k ,<p k ) 

k=0 k=0 k=0 
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which completes the proof of the corollary. □ 
2.3. The Sharp Mean-Dispersion Principle. 

Theorem 2.3 (Mean-Dispersion Principle). Let {efc}£L be any orthonormal sequence in 
L 2 (R). Then for all n>0, 

(2-5) £ (A 2 (e fc ) + A 2 (e k ) + |/,(e fc )| 2 + \»{e k )\ 2 ) > (n + ^ + 1} . 

fc=0 

Moreover, if equality holds for all < n < no, then there exists {ck}™=o C C smc/i i/iai 
|cfc| = 1 and e k = c k h k for each < k < uq. 

Proof. Since H is positive and self-adjoint, one may use Corollary 12.21 It follows from Corol- 
lary that for each n > one has 

n Oh -L 1 " 

(2-6) E^^E(^^)- 

fc=0 fc=0 

From ()2.3|) . note that since ||efe||2 = 1> 

(ffe fc , e fc ) = A 2 (e fc ) + A 2 (^) + |/u(e fc )| 2 + |n(4)| 2 - 

This completes the proof of the first part. 

Assume equality holds in Q2.5JI for all n = 0, . . . , no, in other terms that, for n = 0, . . . , no, 

2n + 1 

{Hen, e n ) = A 2 (e n ) + A 2 (e~) + |«(e n )| 2 + |u(e;)| 2 = — . 

Let us first apply Q2.3|) for / = e$: 

Yl -J— \( e o> h k)\ 2 = (He ,e ) = ^- = Y2^-\(e ,h k )\ 2 

k=0 k=0 

since 1 1 eo [ 1 2 = 1- Thus, for k > 1, one has (eo,h k ) = and hence eo = cqUq. Also 1 1 eo 1 1 2 = 1 
implies |co| = 1. Next, assume that we have proved e k = c k h k for k = 0, . . . , n — 1. Since e n 
is orthogonal to e k for k < n, one has (e n , h k ) = 0. Applying (|2.3f) for / = e n we obtain that, 

E —^-\( e n^ h k)\ 2 = Y2^^ L \( e n,hk)\ 2 = {He n ,e n ) = 1 = ]T 1 \{e n ,h k )\ 2 . 

k=n k=0 k=n 

Thus (e n , /ifc) = for k > n. It follows that e n = c n h n . □ 

Example 2.4. For a// n > 0, the Hermite functions satisfy 

2n + 1 

fi(h n ) = n{h n ) = and A 2 (h n ) = A 2 (h n ) = — . 

47T 

For comparison, let us remark that Bourgain has constructed an orthonormal basis {6 n }^ =1 
for L 2 (R), see [B[, which satisfies A 2 (6 n ) < + e and A 2 (6 ra ) < + £• However, it is 

difficult to control the growth of n(6 n ),u(6 n ) in this construction. For other bases with more 
structure, see the related work in BCGP that constructs an orthonormal basis of lattice 
coherent states {3 m , n }m,nez f° r L 2 (R) which is logarithmically close to having uniformly 
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bounded dispersions. The means (fJ>(g m ,n), ^(9m}i)) for this basis lie on a translate of the 
lattice ZxZ. 

It is interesting to note that if one takes n = in Theorem 12.31 then this yields the usual 
form of Heisenberg's uncertainty principle (see |FSj for equivalences between uncertainty 
principles with sums and products). In fact, using (|2.3j) . Theorem 12.31 also implies a more 
general version of Heisenberg's uncertainty principle that is implicit in |FSj . In particular, if 
/ G L 2 (M) with ||/|| 2 = 1 is orthogonal to ho, . . . , h n _\ then 

a 2 (/) + a 2 (J) + K/)I 2 + IMJ)I 2 >^. 

For instance, if / is odd, then / is orthogonal to ho, and fi(f) = //(/) = 0. Using the 
usual scaling trick, we thus get the well known fact that the optimal constant in Heisenberg's 
inequality, e.g., see |FSj . is given as follows 

1 II #l|2 



A(/)A(/) > 




2 in general 
2 if / is odd 



Corollary 2.5. Fix A > 0. If {e k }^ =0 C L 2 (M) is an orthonormal sequence and for k 
0, . . . , n, satisfies 

lA*(e fc )|, He k )\, A(e k ), A(e k ) < A, 

then n < 8irA 2 . 



Proof. According to Theorem 12.31 

(n + l)(2n + l) 



4(n + 1)A 2 > ^ (A 2 (e fc ) + A 2 {e k ) + \^{e k )\ 2 + \^{e k )\ 2 ) 



> 



47T 
k=0 

It follows that 2n + 1 < 16ttA 2 . □ 

This may also be stated as follows: 
Corollary 2.6. // {e k }^L C L 2 (M) is an orthonormal sequence, then for every n, 



2n + 1 

max{|^(e fc )|, \n(e k )\, A(e k ), A(e k ) : < k < n} > J — . 

2.4. An extension to Riesz bases. Recall that {xfc}^L is a Riesz basis for L 2 (IR) if there 
exists an isomorphism, U : L 2 (M) — > L 2 (M), called the orthogonalizer of {x k }^L , such that 
{Ux k }^L is an orthonormal basis for L 2 (IR). It then follows that, for every {a n }^L € £ 2 , 

2 



( 2 -7) W^^ K 

II II n=0 



2 < 



oo 
n=0 



OO 



2 n=0 

One can adapt the results of the previous sections to Riesz bases. To start, note that the 
Rayleigh-Ritz technique leads to the following, cf. |RS1 Theorem XIII. 3, page 82]: 

Lemma 2.7. Let H be a positive, self-adjoint, densely defined operator on L 2 (R) ; and let 
{x k }^L be a Riesz basis for L 2 (M) with orthonormalizer U. Then, for every n > 0, 

n n 

(2.8) < \\U\\ 2 J2( H ^k,x k ). 

k=0 k=0 
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Proof. Let us take the notations of the proof of Corollary 12.21 Write <p k = Ux k , it is then 
enough to notice that 

M = [(Hx k ,x k )} = [{HTJ- X x k , U^xk)} = [(U- 1 *HTJ- X x k ,x k )}. 

As U^ 1 *HU~ 1 is a positive operator, that the Rayleigh-Ritz theorem gives 

n n 

^{Hx^Xk) >Y J ^k(U- 1 *HU- 1 ). 

k=0 k=0 

But, 

\ k {U- l *HU- 1 ) = sup inf (U^^U^ip,^) 

tpo,- tfk-i V , e[^or--,'Pfc-i] ± ,l!i/'l|2<i 

sup inf <W"V,^"V) 

v>o,- ,<Pk-i ^e[<po,- .^fc-i] ± ,IIV'll2<i 



sup _ inf (Htp,tp\ 

¥'o,-,¥'fe-i^e[!7* ¥ ,o,-,^*¥'fc-i]- L ,ll^l|2<l N 



and, as HtMllo < lit/ 



2 ^ ^ P> 2 



XkiU-^HU- 1 ) > sup _ inf _ (H^^) = T ^X h {H). 



□ 

Adapting the proofs of the previous section, we obtain the following corollary. 
Corollary 2.8. If {^fcjfcLo * s a ^ esz basis for L 2 (R) u;zf/i orthonormalizer U then for all n, 

£ (A 2 (* fe ) + A 2 (3^) + |Mx fe )| 2 + |/^)| 2 ) > (n+ /l, ( ^2 +1) - 
iS 4tt||17|| 

Thus, for every A > 0, i/iere are at mosi 87r^4 2 1 1 1 1 2 elements of the basis {x^j^Q such that 
\/j,(e n )\, \fj,(e^)\, A(e n ), A(e^) are all bounded by A. In particular, 



1 2n + 1 

max{\jj,(x k )\, \fJ,(x k )\, A(x k ), A(x k ) : < k < n} > jj^jj- y ^ ■ 

3. Finite dimensional approximations, spherical codes and the Umbrella 

Theorem 

3.1. Spherical codes. Let IK be either R or C, and let d > 1 be a fixed integer. We equip 
K. d with the standard Euclidean scalar product and norm. We denote by the unit sphere 
of K d . 

Definition. Let A be a subset of {z £ K : \z\ < 1}. A spherical A-code is a finite subset 
V C §d such that if u, v G V and u / v then (u, v) G A. 

Let Nx(A, d) denote the maximal cardinality of a spherical A-code. This notion has been 
introduced in |DGS| in the case K = R where upper-bounds on Nr(A, d) have been obtained. 
These are important quantities in geometric combinatorics, and there is a large associated 
literature. Apart from |DGSj . the results we use can all be found in |CSj . 



Ni(a,d) 
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Our prime interest is in the quantity 

( N R ([-a,a],d), when X = R 

~ [N c ({z£C:\z\<a},d), when IK = C 

for q G (0, 1]. Of course N^(a,d) < N^(a,d). Using the standard identification of C d with 
R 2d , namely identifying Z = (x 1 + iy 1 , . . . ,x d + iya) G C d with Z = (xi,y 1 , . . . ,x d , yd) G R 2d , 
we have = Re(Z,Z') c< i. Thus iVg(a,d) < iV|(a,2d). 

In dimensions d = 1 and d = 2 one can compute the following values for N£(a,d): 
. JV|(a,l) = l 

• If < q < 1/2 then iV|(a,2) = 2 

• If cos f < q < cos ^ and 3 < iV then iV|( a , 2) = jV. 
In higher dimensions, one has the following result. 

Lemma 3.1. If < a < \ then iV|(a,d) = d. 

Proof. An orthonormal basis of K d is a spherical [— a, a]-code so that N£(a,d) > d. For 
the converse, let a < 1/d and assume towards a contradiction that wq, ■ • • , w d is a spherical 

[— a,a]-code. Indeed, let us show that wo,...,w d would be linearly independent in K d . 

d 

Suppose that ^jWj = 0, and without loss of generality that |Aj| < |Ao| for j = l,...,d. 

3=0 

d 

Then Ao||wo|| 2 = — ^j( w j' w o) so that |Ao| < |Ao|<ia. As da < 1 we get that Ao = and 

3=1 

then Xj = for all j. □ 

In general, it is difficult to compute N£(a,k). A coarse estimate using volume counting 
proceeds as follows. 

Lemma 3.2. // < a < 1 is fixed, then there exist constants < a\ < 02 and < C such 
that for all d 

^e aid < N£(a,d) < Ce a2d . 

Moreover, for a < 1/2 one has N£(a,d) < 3 d if K = R, and Nfc(a,d) < 9 d ifK = C. 

Proof. The counting argument for the upper bound proceeds as follows. Let {wj}^ =1 be a 
spherical ^4-code, with A = [—a, a] or A = {z G C : \z\ < a}. For j / k, one has 

\\wj — Wk\\ 2 = \\wj || 2 + ll^fcl) 2 + 2Re (wj,Wk) > 2 — 2a. 



So. Ilic open balls D \ « \j - — j of center Wj and radius J - - — are all disjoint and 



/l - a 

included in the ball of center and radius 1 + y — - — . Therefore 

N £d — =— < cd 1 + 
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where Cd is the volume of the unit ball in K , h = 1 if K = R and h = 2 if K = C. This 

hd 

id ■< 



gives the bound N < (l + yj j^j . Note that for a < 1/2 we get N < 3 d if K = R and 

N < 9 d if K = C. The lower bound too may be obtained by a volume counting argument, 
see |ES|. □ 

The work of Delsarte, Goethals, Seidel, 11 )QS provides a method for obtaining more refined 
estimates on the size of spherical codes. For example, taking j3 = —a in Example 4.5 of [D( 
shows that if a < A= then 



(3.9) N^(a,d)< 



a 2 )d 



a 



2 d ' 



Equality can only occur for spherical {—a, a}-codes. Also, note that if a = 1 — Jj-, then 



3.2. Approximations of orthonormal bases. We now make a connection between the 
cardinality of spherical codes and projections of orthonormal bases. 

Let H be a Hilbert space over K and let = {V'fclfcLi De an orthonormal basis for 7i. For 
an integer d > 1, let be the orthogonal projection on the span of {ipi, . . . , ipd}- For e > 0, 
we say that an element / G H is e, d-approximable if ||/ — Pd/H^ < e, and define 6 ei d to be 
the set of all of / G H with ||/||^ = 1 that are e, d-approximable. We denote by Ak(e, d) the 
maximal cardinality of an orthonormal sequence in fi e ^. 

Example 3.3. Lei {e.,}™ =1 6e t/ie canonical basis for W 1 , and let {ipj}™!^ be an orthonormal 
basis for V^, where V = span{(l, 1, . . . , 1)}. Then \\e k — P n _iefc|| 2 = holds for each 
1 < k < n, and hence n — 1) > n. 

Our interest in spherical codes stems from the following result, cf. [PJ Corollary 1]. 

Proposition 3.4. If < e < l/\/2 and a = j^r, then A^(e,d) < N^(a,d). 

Proof. Let {il>k}kLi be an orthonormal basis for 7i, and let £> Et d and P^ be as above. Let 
{fj}jL\ C W be an orthonormal set contained in fi £) ^. For each k = 1, . . . , N, j = 1, . . . , d, 

d 

let a k ,j = (fk,ipj) and write F d fk ■= ^ Qfc.jV'j so that \\f k -~P d f k \\ H < e. 

i=i 

Write v k = (afc i, . . . , a ki d) G then, for 7^ / 

<U fc , V,) = (P d /fe, P d /i) = (Pd/fc -f k + fk, Pd/z ~fi + fi) 

= (Pd/fc - /*, Pd/z - fi) + (Pd/fc - fk, fi) + </*, Pd/z - fl) 

= (Prf/fc — /fc> Pd/z — /z) + (Pd/fc — /fc, // — Pd/i) + (/it — Pd/fc, Pd/z — /z) 

(3.10) = (fk-Vdfk^dfi- fi) 

since Pd/fc — //« is orthogonal to Pd/z- It follows from the Cauchy-Schwarz inequality that 
\{v k ,vi)\ < e 2 . 

On the other hand, 

[NIk* = W^dfkWn = (ll/fellw " ll/fc " ^dfk\\ 2 H ) 1/2 > (1 - £ 2 ) 1/2 - 
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It follows that w k = 77— r, — satisfies, for k^l, \ (w k ,u>i)\ = jl'H - < -p^, and {w k }k-i 

\\Vk\\vd \\ v k\\ K d\\vi\\ K d 

is a spherical [—a, a]-code in K. d . □ 



Note that the proof only uses orthogonality in a mild way. Namely, if instead {fj}jLi C TL 
with = 1 satisfies \{fj,fk)\ < ff f° r 3 7^ ^, then Equation (|3.1Uj) becomes (vk,vi) = 

{fk ~ Wdfk,Wdfl ~ fi) + (fk, fi), so that \(v k ,vi}\ < e 2 + ?? 2 , and the end of the proof shows 

that iV< i\r«fi-±iL,d 

In view of Proposition 13.41 it is natural ask the following question. Given o — -. £ _ 3 , 
is there a converse inequality of the form N^(a,d) < CA^(e',d!) with C > an absolute 
constant and e < e' < Ce, d < d! < Cd? Note that for e such that a < 1/d, we have 
A K (e,cf) = Nl(a,d) = d. 

3.3. Prolate spheroidal wave functions. In order to obtain quantitative versions of Sha- 
piro's theorems, we will make use of the prolate spheroidal wave functions. For a detailed 
presentation on prolate spheroidal wave functions see |SP1 ILP11 ILP2j . 

Fix T, > and let {ip n }^Lo be the associated prolate spheroidal wave functions, as defined 
in |SP| . {ip n }%Lo 1S an orthonormal basis for PWq, = {/ G L 2 (R) : supp/ C [— Q,fi]}, and 
the ip n are eigenfunctions of the differential operator 

As in the previous section, for an integer d > 0, define ¥ d to be the projection onto the span 
of tpo, . . . , ipd-ii an( i f° r e > define 

S e , d = {/ G L 2 (R) : \\f\\ 2 = 1, ||/ - P d /|| 2 < e}. 

For the remainder of the paper, the orthonormal basis used in the definitions of S e di IP<2> 
and A^(e, d), will always be chosen as the prolate spheroidal wavefunctions. Note that these 
quantities depend on the choice of T, f2. 
Finally, let 

VT,n,e = \f£L 2 (R): [ \f(t)\ 2 dt<e 2 and / |/(£)| 2 d£ < e 2 1 

[ J\t\>T J\(\>Q J 

and T>T,e = Vt,T,z- 

Theorem 3.5 (Landau- Pollak |LP2j ). Let T, e be positive constants and let d = [4TJ1J + 1. 

Then, for every f G V T ,n,e, 

\\f-F d f\\ 2 2 <A9e 2 \\f\\ 2 2 . 
In other words, V T ,n,e D {/ G L 2 (R) : ||/|| 2 = 1} C S 7Eyd . 

It follows that the first d = 4T 2 +1 elements of the prolate spheroidal basis well approximate 
Vt,si an d that Vr,e is "essentially" (f-dimensional. 
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3.4. Generalized means and dispersions. As an application of the results on prolate 
spheroidal wavefunctions and spherical codes, we shall address a more general version of the 
mean-dispersion theorem. 

Consider the following generalized means and variances. For p > 1 and / £ L 2 (M) with 
1 2 = 1, we define the following associated p-variance 



A2(/) = inf| \t-anf(t)\ 2 dt. 



One can show that the infimum is actually a minimum and is attained for a unique a £ 
that we call the p-mean 



Hp(f) = arg min aeK j \t - a| p |/(i)| 2 dt. 



As before, define the p-dispersion A p (f) = yA 2 (/). 

The proof of the Mean-Dispersion Theorem for p = 2 via the Rayleigh-Ritz technique 
relied on the special relation (|2.3|) of means and dispersions with the Hermite operator. In 
general, beyond the case p = 2, such simple relations are not present and the techniques of 
Section |21 are not so easily applicable. However, we shall show how to use the combinatorial 
techniques from the beginning of this section to obtain a quantitative version of Theorem ll.3l 
for generalized means and dispersions. 

The following lemma is a modification of [El Lemma 1]. 

Lemma 3.6. Let A > and p > 1. Suppose g E L 2 (R), \\g\\ 2 = 1 satisfies 

K(s)l> A p (g)<A. 

Fix e > 0, then g G V A+ ( A /e) 2 /p,e- 

This gives a simple proof of a strengthened version of Shapiro's Mean-Dispersion Theorem: 

Corollary 3.7. Let < A, 1 < p < oo, < e < l/7y/2, a = 49e 2 /(l - 49e 2 ) ; and set 
d= [4(A+(A/e) 2 /P) 2 \ +1. 

If {ek}k = i C L 2 (R) is an orthonormal set such that for all 1 < k < N, 

|/W P (efc)|, \lip{e k )\, A p (e fc ), A p (e k ) < A, 

then 

N<Ni.(a,d)<Nl(a,2d). 

Proof. According to Lemma 13. 6| e\,...,e n are in Va+^a/e^/p e- r ^ ne definition of d and 
Theorem 13.51 show that {ej}^ =1 C Q7 £ ,d- According to Proposition 13.41 N < Ac(7e,d) < 
N£(a,d) < N^(a,2d), where a = 49e 2 /(l - 49e 2 ). □ 

This approach does not, in general, give sharp results. For example, in the case p = 2 the 
bound obtained by Corollary 13.71 is not as good as the one given in Section 2. To see this, 
assume that p = 2 and A > 1. Then 4A 2 (l + 1/e) 2 < d < 5A 2 (l + l/e) 2 . In order to apply 
the Delsarte, Goethals, Seidal bound (|3.9j) we will now chose e so that a < which will 
then give that n < 4d. Our aim is thus to take d is as small as possible by chosing e as large 
as possible. 
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For this, let us first take e < 1/50 so that a < 50e 2 . It is then enough that 50e 2 < 

1 1 1 i 1 

that is e 2 + e — — ®- We ma y thus * a ^ e £ = — ■ Note that, as A > 1, we get that 

/i+j x 

e < V 2 5 ° < 1/50. This then gives 



n < 20d < 20A 2 I 1 + 



2 | = 20,4 2 fl + 100^( A /l + -^+l)^ <CA 4 



In particular, the combinatorial methods allow one to take N = CA 4 in Corollarv l3.7l whereas 
the sharp methods of Section 12.31 give N = 8irA 2 , see Corollary 12.51 

3.5. The Quantitative Umbrella Theorem. A second application of our method is a 
quantitative form of Shapiro's umbrella theorem. As with the mean-dispersion theorem, 
Shapiro's proof does not provide a bound on the number of elements in the sequence. As 
before, the combinatorial approach is well suited to this setting whereas the approach of 
Section [2 is not easily applicable. 
Given / e L 2 (R) and e > 0, define 



C / ( £ )=inf{r>0 : f \f(t)\ 

{ J\t\>T 



2 <e*\\f\\l 



Note that if / is not identically zero then for all < e < 1 one has < C/(e) < oo. 

Theorem 3.8. Let cp,tp € L 2 (R) and M = min{||</?|| 2 , ||V>|| 2 } > 1. Fix > e > 0, 
T > max{C </ ,(e),C^(e)} ; and d = [4T 2 J + 1. 

If {e n }n = i is an orthonormal sequence in L 2 (R) such that for all 1 < n < JV, and for 
almost all 

(3.11) \e n {x)\ < \(p{x)\ and |e n (£)| < |^(£)|, 
then 

(3.12) N < Ag(50e 2 M 2 , d) < iV|(50e 2 M 2 , 2d). 

In particular, N is bounded by an absolute constant depending only on (p and ip. 

Proof. By (|3.11|) . T > maxjC^e), C^(e)}, implies {e n }^ =l C Vt,eM- According to Theorem 
13.51 Vt,eM C $>7 £ M,d- It now follows from Proposition 13.41 that 

(A9e 2 M 2 \ 
l-49e 2 M* ,d ) ~ ^(SO^M 2 ^) < N^(50e 2 M 2 ,2d). 

□ 



Let us give two applications where one may get an explicit upper bound by making a 
proper choice of e in the proof above. 
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Proposition 3.9. Let 1/2 < p and < C be fixed. If {e n }^ =1 C L 2 (M) is an orthonor- 

mal set such that for all 1 < n < N, and for almost every 

\e n (x)\ < . C , 7T- and \e n (£)\ < 



(i+\x\)p 1 - 

then 



N < I 



o,\V^=i) if 1/2 <p, 



2 



4(f?)^ tf3/2<„. 



Proof. If (p(x) = h^ x \\p ; then M = ||v?||2 = Cw^j > 1, and a computation for < e < 1 



shows that C^(e) = e2/( 2 P _i) - 1- Let 5 = 5(e) = g4/( 2 P -i) and a = a(e) = . Taking 

T = C v (e) implies that d = [4T 2 \ + 1 < 5{e). 

If < e < g^j, then Theorem 13.81 gives the bound N < iVg (a(e), 5(e)). We shall chose e 
differently for the various cases. 

Case 1. For the case 1/2 < p, take e = t^j, and use the exponential bound given by 
Lemma I3~2l for N^(a(e), 5(e)) to obtain the desired estimate. 

Case 2. For the case 1 < p < 3/2, let eo = ( ^og 1 ) ^ > a = a ( e o)) and 5 = 5(eo). Note 
that a = < and also that eo < , since 1 < p < 3/2. Thus the bound ()3.9j) yields 
iV < iV| (a, 25) = 4(1 - a 2 )5 < 45. The desired estimate follows. 

2p-l 

Case 3. For the case 3/2 < p, define ei — y 50CV2 ) and note that ei < scIm - Since 
3/2 < p, taking e < e\, a = a(e), 5 = 5(e), implies that a(e) < l/5(e). Thus, by Lemma l3~Tl 
N < 5(e) for all e < £\. Hence, N < 5(ei), and the desired estimate follows. 

□ 

Note that in the case 1/2 < p, the upper bound in Proposition 13.91 approaches infinity 
as p approaches 1/2. Indeed, we refer the reader to the counterexamples for p < 1/2 in 
DSSTj |By| . The case p = 1/2 seems to be open as |DSST| need an extra logarithmic factor 

in their construction. For perspective in the case 3/2 < p, if one takes C = C p = \J^j^-, 
then the upper bound in Proposition 13.91 approaches 4 as p approaches infinity. 



Proposition 3.10. Let < a < 1 and (2a) 1 / 4 < C be fixed. If {e n }% =0 C L 2 
orthonormal set such that for all n and for almost every x,£ E R 

K(x)\ < Ce~™W 2 and \e n (0\ < Ce~™^ 2 , 

then 

N < 2 + — max ^ 2 hi ^ ,ln — — \. 

- cm I V « 1/4 / \ a 5 /2 e 2^ J / 



^s an 
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Proof. Let j a (x) = Ce - ™^ 2 and let C a {e) = C 7a (e). First note that 

\t\>T J\t\>T V a JTy/E + S 

CMl + aT 2 ) _ 2waT2 
/5 ' 



while M = H7JI2 = (JiRC 2e 27r °l*l 2 dt) 1 / 2 = t^TS • particular, ||7 a || 2 > 1. Now for every 

T > 0, set e(T) = 2~ 1 / 4 ^ v / l + aT 2 e"™ T2 , so that C a {e{T)) < T. 

By Theorem l!nn we get that N < iV|(50e 2 (T)M 2 , 8T 2 + 2), provided e(T) < 5^7. Let us 
first see what condition should be imposed on T to have e(T) < -gjjjj- Setting s = (1 + aT 2 ), 
this condition is equivalent to ^e~ ns < e 50( ^J^ ■ Thus, it suffices to take s > ^ In ^ 50 ^ v ^f e ^ , 
and T 2 > A i n ( socwy 

— a7r ^ a 1 / 4 / 

We will now further choose T large enough to have 50e(T) 2 M 2 < 8r ^ +2 , so that Lemma 
IQ will imply N < iV|(50e(T) 2 M 2 , 8T 2 + 2) = 8T 2 + 2. This time, the condition reads 
(1 + aT 2 )(l + 4T 2 )e- 2 ™ T2 < 5^7. Let r = a(4T 2 + 1). Thus, it suffices to take r 2 e~% r < 
50nC 2 e ^a/2 « is enough to take r > - in ^ a5/2e 2^ J , and J > — in ^ a5/ 2 e 2n 

Combining the bounds for T 2 from the previous two paragraphs yields 



N 



< 2 + — max ^ 2 In ^- , In — — k 

a-n I V « 1/4 / V a 5 /2 e 2vr I / 



□ 



A careful reading of the proof of the Umbrella Theorem shows the following: 



Proposition 3.11. Let < C, and let 1 < p, q,p,q < oo satisfy — I — = 1 and — + — = 1. 

p q p q 

Let ip £ L 2p (R) and if; £ L 2p (R), and suppose that ip k G L 2q (R) and ip k £ L 2q (R) satisfy 
WfkWzq — C, \\1pkW2q — C- There exists N such that, if {e^} C L 2 (IR) zs an orthonormal set 
which for all k and almost every i,(£R satisfies 

\ek(x)\ < Vk(x) tp(x) and \e k (£)\ < ipk{0 

then {e/j} /10s ai most N elements. As with previous results, a bound for N can be obtained 
in terms of spherical codes. The bound for N depends only on ip,i/),C. 

Indeed, let e > and take T > big enough to have / \f(t)\ 2p dt < e p /C 2/p . Then 

J\t\>T 

|e fc (t)| 2 di < / \<p k (t)<p(t)\ 2 dt < ( f \Mt)\ 2q dt) If Mt)\ 2p dt) 



'\t\>T J\t\>T \J\t\>T J \J\t\>T J 

< C 2 (e p /C 2 / p ) 1 / p = e. 
A similar estimate holds for ej~ and we conclude as in the proof of the Umbrella Theorem. 
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3.6. Angles in Riesz bases. Let us now conclude this section with a few remarks on Riesz 
bases. Let {xk}^L be a Riesz basis for L 2 (M) with orthogonalizer U and recall that, for 
every sequence {a n }^L £ £ 2 , 

2 rvn 



(3.13) 



aJ 2 < 



n=0 



n=0 



-111 2 



n=0 



Taking a n = 5 n ^ in (|3.13|) shows that p^- < Hx/d^ < Then taking a n = S n) f.+XS n> i, 

k 7^ I and A = t, —t, t > gives 



r(l+t 2 )< 



+ r 



+ 2t\Re{x k ,xi)\ < IIC/- 1 !! (1 + t 2 ) 



thus \Re{xk,X[) | is 



< min((||x fc || 2 -||[/| r 2 )( || Xz ||2 



< 



jll^Himin 



1 



1 



\u- 



)(II^ 1 I 



i 



\\u\\ 2 \\u-i\\\ 

while taking A = it, —it, t > gives the same bound for \Im(xk, xi)\ 2 . It follows that 
(3.14) 
where 

1 



\(x k ,xi)\<C(U)\\x k \\ 2 \\xi\\ 2 <C(V)\\U 

' fWu- 1 



-HI 2 



C(U) := \/2min 



1 



1 



Mu\\\\u-±\ 

We may now adapt the proof of Proposition 13.41 to Riesz basis: 

Proposition 3.12. Let {ipk)kLi be an orthonormal basis for L 2 (M). Fix d > and Zef P^ be 

the projection on the span of {V>i, • • • , tpd—l}- 

Let {xk}fcL± be a Riesz basis for L 2 (W) and let U be its orthogonalizer. Let e > be such 

-^ IWr'-W- HT) a nd let 



that e < min 



(3.15) 



o 



e 2 + C(C/)||C/- 1 || 2 



" 2 - £ 2 



// {xk}k =1 satisfies \\xk — PdXfc|| 2 < e i/ien ./V < JVj|(a, d). 

Proof. Assume without loss of generality that xq,...,xn satisfy \\xf~ — P^a^H < e and let 
Write vt = (fflfc,i, • • • , £ ^ then, the same computation as in ()3.1U|) . for 7^ / 

= {Xk ~ PdXk,~PdXl - Xl) + (Xk,Xl) 



thus < e 2 + On the other hand 

hw = Wdx k \\ = {\\x k \\ 2 - \\ Xk - PrfAf) 1 / 2 > 



e2) l/2 



1(5 
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It follows from (|3.14[) that Wk = j, — ry satisfies, for k^l, 



\Vk\ 

\(w k ,wi}\ < 



e 2 + C{U)\\U~ l \\ 2 
\\U\\- 2 -e 2 



and {u>fc} forms a spherical [—a, a] -code in K. d . □ 

Note that the condition on e implies that < a < 1. Also note that if U is a near 
isometry in the sense that (1 + f3)~ l < \\U\\ 2 < ||i7 _1 || 2 < 1 + P then C(U) < v^t§J§ and 

« < ffi P) ■ In Particular, if U is near enough to an isometry, meaning that (5 is small 

enough, then this a is comparable with the a of Proposition 13.41 

As a consequence, we may then easily adapt the proof of results that relied on Proposition 
13.41 to the statements about Riesz bases. For example, an Umbrella Theorem for Riesz bases 
reads as follows: 

Theorem 3.13. Let ip,tp £ L 2 (1R) with \\ip\\ 2 , \\1pW2 — 1- {fn}^Li be a Riesz basis 

for L 2 (M) with orthonormalizer U that is near enough to an isometry + < \\U\\ 2 < 
— 1 + with (5 small enough. Then there exists a constant N = N((p,ip,/3) depending 
only on <p, tp and (5, such that the number of terms of the basis that satisfies 

\fn(x)\<\ip(x)\ and \f n (0\ < {HOI 

for almost all x,t; £ M is bounded by N. As with previous results, a bound on N can be given 
in terms of spherical codes. 
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